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RELATED APPLICATIONS 



This application claims priority of U.S. Patent Application Serial No. 
09/175,195 (Attorney Docket No. CISCP048), naming Richard A. Johnson et al. as 
inventors, and entitled "Apparatus and Method for Performing Traffic Redirection in 
a Distributed System Using a Portion Metric." That application is incorporated herein 
10 by reference in its entirety and for all purposes. 



The present invention relates generally to traffic redirection in a distributed 
system. More particularly, the present invention relates to a method and apparatus for 
1 5 redirecting traffic among a number of servers using a portion metric for each server. 



A computer network may be defined as an interconnected collection of 
autonomous computers. In a distributed system, the existence of these multiple 
autonomous computers is transparent to the user. To achieve this transparency, 
20 allocation of jobs to processors and all other system functions must be automatic. 
These automated system functions are typically provided by an operating system. 

In general, the operating system hides the details of the hardware from the 
user and provides the user with a convenient interface for using the system. More 
particularly, the operating system is responsible for allocating resources v^thin the 
25 distributed system and schedules the execution of various services accordingly. Thus, 
the operating system selects the best processor, locates and transfers all corresponding 
appropriate location. In this manner, the operating system ensures that system 
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Field off the Inveimttioini 



Descrnptiom off the Related Ant 



resources such as file servers are used in an efficient manner. 

The resource allocation provided by the operating system includes the 
retrieval and processing of data. Often, this data is stored on one or more shared file 
servers. Users in such a system are called clients. Communication from a client 
generally comprises a request message asking for a particular service to performed. 
The service request message is then sent to an appropriate server. The server then 
does the work requested and sends back a reply. Thus, data is accessed and processed 
by the server in accordance with the service request message. 

In order to send a service request message to a server, the operating system 
must first select an appropriate server. Typically, the operating system selects a 
server according to criteria that may be applied through the use of a metric. By way 
of example, a commonly used metric distributes the service request message to a 
server closest in distance to the client. Accordingly, the operating system may direct 
traffic to a server according to a specified metric. 

Although various metrics exist for allocating resources within a network, these 
metrics do not adjust assignment of server requests in accordance v^th the load 
capacity of each server. By way of example, each server may have different 
processing capabilities. As yet another example, a server may be entirely unavailable. 
Metrics traditionally used in distributed systems do not adjust assignment of requests 
according to such situations. As a result, these metrics cannot adequately maximize 
the throughput of a distributed system having servers with heterogeneous load 
capabilities. It would be desirable if a metric were provided such that server requests 
could be distributed in accordance with the load capacity of each server. 

Every host and router on the Internet has an IP address. The Domain Name 
System (DNS) is often used to map host names to these IP addresses. By way of 
example, a client typically sends a DNS query to a DNS server which includes a host 
name and an indication that an IP address is requested. The DNS server then retums 
an IP address associated with the host name. It would therefore be desirable if a DNS 
server were designed to accomplish load distribution compatible with the Intemet and 



CISCP048C1/LVP/ERH 



Page 2 



DNS. 



In addition to ensuring adequate load distribution, it is necessary to 
accommodate for shifts of information. Information is commonly transferred from 
one web server to another web server on the Intemet. Typically, a web server will 
redirect a client to a new location due to this physical shift of information between 
web servers. However, the host of the client is typically not taken into consideration 
during selection of the web server in providing this redirect. In m£iny instances it 
would be desirable if the information could be selectively varied according to the 
particular client requesting the information. It would therefore be desirable if the 
host of the client were considered during selection of an appropriate web server in 
order to provide such a redirect. Moreover, it would be beneficial if the client could 
easily identify the server to which the server request is redirected. 

In view of the above, a system and method for redirecting traffic in a 
distributed system according to individual server capabilities would be desirable. 
Additionally, it would be beneficial if such a system distributed traffic in proportion 
to individual server portion metrics. 
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SUMMARY OF THE INVENTION 

The present invention provides a method and apparatus for distributing server 
requests among a plurality of servers in a distributed system. This is accomplished 
through assigning a portion metric to each server. In this manner, the portion metric 
allows capabilities of each candidate server to be taken into consideration during 
distribution of each server request. 

In accordance with one aspect of the present invention, each server request is 
distributed in accordance with the Domain Name System (DNS). Configuration is 
performed in several steps. First, a Domain Name System host name is associated 
with a plurality of servers, each one of the servers having a unique IP address. By 
way of example, a DNS table on a DNS server typically includes a plurality of 
entries, each of the entries containing an IP address-host name association. Second, a 
portion metric is then assigned to each one of the plurality of servers. The portion 
metric designates a portion of total server requests to be allocated to the one of the 
plvirality of servers. As a result, each web server is assigned a portion metric. Each 
portion metric designates a portion of total server requests to be allocated to the 
corresponding server. Once configuration is completed, server requests may be 
allocated. 

Each server request is separately allocated to a selected web server. The 
server request may include a DNS host name query received from a client. By way of 
example, a DNS host name query may include a host name to be looked up and an 
indication that an IP address is requested. A plurality of IP addresses associated with 
the host name may then be obtained, each one of the IP addresses being associated 
with one of the plurality of servers. A total number of server requests processed by 
the plurality of servers is incremented. In addition, a number of server requests 
distributed to each one of the plurality of servers is maintained. One of the plurality 
of servers is selected using the portion metric assigned to each one of the plurality of 
servers, the number of server requests distributed to each one of the plurality of 
servers, and the total number of server requests. An IP address associated with the 
selected server is then provided to the client from which the server request was 
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obtained. Accordingly, when each server request is received, one of the plurality of 
IP addresses is determined and provided to the client. In this manner, traffic sent to 
geographically distributed web servers in multiple networks may be distributed and 
monitored. Moreover, the present invention may be and is preferably geographically 
5 distant from the web servers. Thus, connections distributed to each web server may 
be monitored and load averages of each web server may be subsequently queried. 

In accordance with another aspect of the present invention, the portion metric 
is used in combination with other metrics. A portion metric is assigned to each one of 
a plurality of servers. The portion metric for each one of the plurality of servers is 

1 0 added to obtain a total portion metric. A number of server requests distributed to each 
one of the plurality of servers is maintained. In addition, a total number of server 
requests processed by the plurality of servers is incremented. A server request is then 
received. A server request percentage is computed for each one of the plurality of 
servers. The server request percentage for each one of the plurality of servers is a 

1 5 product of the number of server requests distributed to the one of the plurality of 
servers and the total portion metric divided by the total number of server requests 
received. In addition, a metric value is calculated for each one of the plurality of 
servers. The metric value for one of the plurality of servers is defined by the server 
request percentage for the one of the plurality of servers and the portion metric for the 

20 one of the plurality of servers. The metric values for the plurality of servers are 
compared to obtain a selected server. 

The selected server may include a set of servers. The set of servers may 
include those servers having an identical metric value. Alternatively, a tolerance 
range may be configured for the portion metric. The tolerance range may then be 
25 used in comparing the metric value for each of the servers. Thus, a set of the plurality 
of servers having a metric value within the tolerance range may be obtained. An 
altemate metric may then be applied to the set of servers to obtain a single selected 
server. By way of example, the altemate metric may include a distance metric. The 
selected server is then provided to the client. 

30 In accordance with yet another aspect of the invention, load imbalance may be 



CISCP048C1/LVP/ERH 



Page 5 




detected. As described above, a set of the plurality of servers is obtained upon 
application of the portion metric. When the set of the plurality of servers includes 
only one server, this may indicate the presence of load imbalance. By way of 
example, if there is a large disparity in the amount of server requests distributed to 
5 each server, there will be one server that has received the least amovmt of server 
requests, and this server will therefore be selected. A report may be generated 
indicating the presence of this load imbalance in the distributed system. Moreover, 
the detection of load imbalance permits reallocation of servers as well as the 
reconfiguration of portion metrics and tolerance ranges. 

10 In accordance with a further aspect of the invention, a portion metric is 

assigned to each server. An HTTP connection is accepted. A total number of server 
requests processed by the plurality of servers is incremented. In addition, a number of 
server requests distributed to each of the servers is maintained. One of the plurality of 
servers is then selected using the portion metric assigned to each of the servers, the 

1 5 number of server requests distributed to each of the servers, and the total number of 
server requests. An HTTP code redirect is then sent to the client. Therefore, 
information provided to a client may be selectively varied according to the particular 
client requesting the information. 

The advantages of assigning a portion metric to a set of servers and 
20 distributing a service request in accordance with such portion metric assignments are 
numerous. The present invention may be used to allow capabilities of each candidate 

server to be taken into consideration during distribution of each server request. By 
way of example, portion metric assignments may be used to maximize the throughput 
of a distributed system having servers with diverse processor speeds. Similarly, 
25 portion metrics may be reassigned upon a determination that a server is unavailable. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



FIG, 1 is a block diagram illustrating an exemplary distributed system in 
which the portion metric of the present invention may be implemented. 

FIG. 2 is a block diagram illustrating a distributed system in which the 
distributed director of the present invention is implemented. 

FIG. 3 is a flow diagram illustrating a method for operating the distributed 
director in DNS mode according to one embodiment of the present invention. 

FIG. 4 is a flow diagram illustrating a method for calculating the server 
request percentage for one of the plurality of servers according to an embodiment of 
the present invention. 

FIG. 5 is a flow diagram illustrating a method for determining the metric 
value for one of the plurality of servers according to one embodiment of the present 
invention. 

FIG. 6 is a flow diagram illustrating a method for operating the distributed 
director in HTTP redirect mode according to one embodiment of the invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention provides a method and system for redirecting traffic 
among a nimiber of servers. This is accomplished through ascertaining the host for 
the client, determining a plurality of servers associated with the host, and selecting 

5 one of the plurality of servers. The selection may be based upon defined portion 

metrics as well as other metrics. Various embodiments of the invention are possible. 
By way of example, the present invention may be implemented in accordance with 
the Domain Name System as well as the HTTP protocol. In addition, the invention 
can be implemented in numerous ways, including as a method, an apparatus such as a 

10 SMdtching element (e.g., router or brouter), or a computer readable medium. Several 
embodiments of the invention are discussed below. 

Referring first to FIG. 1, a block diagram illustrating an exemplary distributed 
system in which the portion metric of the present invention may be implemented is 
presented. A distributed system 10 will typically include one or more clients 12 and a 
15 plurality of servers. As illustrated, the plurality of servers includes a first server 14, a 
second server 16, and a third server 18. Typically, a client 12 will send a request for a 
service which will be distributed to one of the plurality of servers. To distribute each 
such service request, one or more metrics may be used in the selection process. In a 
commonly used metric, the service request is distributed to the server closest in 
20 distance to the client. The first server 14 is a first distance 20 firom the client 12, the 
second server 16 is a second distance 22 from the client 12, and the third server 18 is 
a third distance 24 from the client 12. As shown, the second distance 22 indicates 
that the second server 16 is closest in distance to the client 12. Accordingly, the 
service request would be distributed to the second server 16 if such a metric were 
25 applied. However, in circumstances where each one of the plurality of servers 14, 16, 
18 is equidistant from the client 12, another metric may be used as a "tie breaker." 
Moreover, the distance between the client and selected server may be deemed 
insignificant in comparison to factors such as the processor speed of each available 
server. In such instances, an alternate metric may be desirable. 

30 Referring next to FIG. 2, a block diagram of a distributed system in which the 
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distributed director of the present invention according to one embodiment is 
presented. A distributed system 70 in which the present invention may be 
implemented is illustrated. According to the present invention, a distributed director 
72 may be provided for each host. As shown, the distributed director 72 is provided 
5 for an Internet host 74. As will be described, a client 76 connecting to the Internet 74 
may wish to connect to a web server located in another geographical location. By 
way of example, the San Jose web server 78 and Paris web server 80 are shown. The 
distributed director 72 determines a host associated with the client 76 (e.g., 
patents.com). The distributed director 72 may be configured to be the primary DNS 
10 server for a particular host. Thus, once the host is determined, the IP addresses 
associated with this host may easily be obtained. In this manner, available web 
servers associated with these IP addresses are made available for selection. 

In response to a server request, the distributed director 72 utilizes one or more 
metrics to select one of these web servers. Metrics utilized by the distributed director 
1 5 72 may include a variety of metrics, including the distance metric. An information 
gathering block 82 may be utilized to gather metric information such as the distance 
to each server. The information gathering block 82 may include a router and 
therefore must have access to all routing tables relating to the geographically 
distributed Web servers. By way of example, the distributed director 72 may query 
20 the information gathering block 82 for distance metrics between the distributed 
servers 78, 80 and the client 76. According to one embodiment, the information 
gathering block 82 may include a Direct Response Protocol (DRP) agent, available 
from Cisco Technology, Inc., located in San Jose, CA. In addition, the distributed 
director 72 provides a portion metric which allows capabilities of each candidate 
25 server to be taken into consideration during distribution of each server request. The 
distributed director 72 may therefore direct clients to an appropriate server that is also 
closest in distance. In this manner, the distributed director 72 provides dynamic, 
transparent, and scalable Intemet traffic load distribution between multiple 
geographically dispersed servers. 

30 Referring next to FIG. 3 , a flow diagram illustrating a method for operating 

the distributed director in DNS mode according to one embodiment of the invention is 
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shown. As shown, a service request is distributed among a pluraHty of servers 26. 
First, at step 28, the distributed director is configured. Configuration may include 
associating each host with a plurality of IP addresses corresponding to a plurality of 
web servers in the distributed system, assigning a portion metric to each server, 
assigning other metrics to each server, and specifying a tolerance range for 
application of each specified metric. 

A portion metric PORTION_METRIC[X] is assigned to each one of the 
plurality of servers, where X = 1 to the total number of servers N. Each portion 
metric designates a portion of total server requests to be allocated to the 
corresponding one of the plurality of servers. For example, portion metrics 20 %, 32 
%, and 60 % may be assigned to three servers, respectively. The portion metrics do 
not necessarily need to total a value of one hundred. Thus, if percentages have been 
overassigned as in the previous example, then the percentage of server requests 
assigned to each server will automatically be scaled down by each server's portion 
metric such that they total 100% of requests and retain their relative configured 
portion of requests. 

According to one embodiment, a portion metric is assigned to each server in 
the distributed system using the DNS host name. For example, "HOST SERVERl 
PORTION 32" could be used to indicate that the portion of server requests for the 
DNS host "SERVERl" is 32. These portion metrics may then be used to determine a 
percentage of server requests to send to each server. Moreover, a defauh portion 
metric may be provided. By way of example, the default portion metric may be zero. 
As a result, servers with a higher portion metric will receive a larger number of server 
requests than servers with a lower portion metric. This allows a network 
administrator to adjust server load distribution across heterogeneous distributed 
servers, resulting in improved performance as seen by clients. 

The portion metric may be used in combination with other metrics or alone to 
distribute server requests in a network or distributed system to select an appropriate 
server for each client. Similarly, any of these metrics may be turned off or on to add 
or remove metrics which are to be considered. For example, a command "PORTION 
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X METRIC2..." may be provided to add the portion metric and a second metric, 
"METRIC2". Similarly, a command "NO METRICS" could be used to turn off a 
third metric, "METRICS". When the portion metric is used in combination with other 
metrics, the order in which these metrics are considered is specified. By way of 
5 example, the first metric specified in the command line may be considered first, 
followed by each following metric specified. 

In addition to specifying a portion metric as well as other metrics for each 
server, a tolerance range may be configured. The tolerance range supplies a range 
used to determine, relative to a lowest or highest value, whether any distributed 

10 servers should be equally preferred for a given client. For example, assume there are 
three servers: SERVERl, SERVER2 and SERVERS associated with values 100, 1 19 
and 125, respectively. If the tolerance value is set to 20 percent, SERVERl 
(associated with value 100) and SERVER2 (associated with value 119) would be 
equally preferred since 1 19 is within the 20 percent tolerance range. SERVERS 

1 5 (associated with value 1 25) would be eliminated from consideration. 

The present invention may be configured to function in a DNS mode as well 
as in HTTP redirect mode. Additional DNS resource records must be added to the 
primary domain's primary DNS server to identify the distributed director as the 
authoritative name server for a given host in DNS mode, or as the actual Web server 
20 requested by the end user in HTTP redirect mode. Moreover, DNS resource records 
used by the distributed director may be configured in an external server or, 
alternatively, in the distributed director. 

Next, at step 29, initialization of variables is performed. During this step, a 
^ variable storing a total number of server requests processed by the plurality of servers, 
25 TOTAL_REQUESTS, is initialized to a constant. In addition, a variable for each one 
of the plurality of servers storing a number of server requests distributed to each one 
of the plurality of servers, NUMBER_SERVER__REQUESTSpC] are each initialized 
to a constant. According to a preferred embodiment, the variables are initialized to 
zero. However, another constant may be used to avoid the potential problem of 
SO dividing by zero. 
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Once the distributed director is configured and initialization is performed, a 
server request may then be received at step 30. By v^ay of example, the server request 
may include a DNS host name query that is received from a client. A DNS host name 
query typically includes a host name to be looked up and an indication that an IP 
address is requested. 

After the server request is received, the portion metrics may be utilized as well 
as other metrics to select an appropriate server in response to each server request. 
According to one embodiment, each server request is distributed in accordance with 
the Domain Name System (DNS). At step 32, a plurality of IP addresses associated 
with the queried host name are obtained. Each one of the plurality of IP addresses are 
associated with one of a plurality of servers in the distributed system. In this manner, 
a set of servers are obtained from which one is to be selected. At step 34, the portion 
metrics for each one of the plurality of servers are added to obtain a total portion 
metric, TOTAL_PORTION. 

In order to distribute the server request to an appropriate server, metric values 
for each one of the plurality of servers are computed. A metric value for a selected 
server X is computed using the number of server requests distributed to the selected 
server, the portion metric for the selected server, the total portion metric, and the total 
number of server requests processed by the distributed system where 
METRIC_VALUE[X] = SERVER_REQUEST_PERCENTAGE[X] - 
PORTION_METRIC[X]. At step 36, a server request percentage is calculated for 
each one of the plurality of servers. For each server, the server request percentage is 
calculated using the number of server requests distributed to the server, the total 
portion metric, and the total number of server requests received where 
SERVER_REQUEST_PERCENTAGEPC] = NUMBER_SERVER_REQUESTSpC] 
* TOTAL_PORTION / TOTAL_REQUESTS. Thus, the server request percentage is 
a percentage of total requests distributed to a server X which is normalized to the 
portion metric assignments. Next, at step 38, a metric value is determined for each 
one of the plurality of servers. For each server, the metric value is determined using 
the server request percentage for the server, the portion metric for the server, and the 
total number of server requests received. According to one embodiment, the metric 
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value is obtained by subtracting the portion metric from the server request percentage, 
METRIC_VALUEPC] = SERVER__REQUEST_PERCENTAGE[X] - 
PORTION METRICPC], However, a constant such as TOTAL_REQUESTS may be 
added to this value to ensure that the METRIC_VALUEpC] is a positive number. 

5 If multiple metrics have been specified for each server, the metrics may be 

combined to form a single metric value prior to server selection. In addition, each 
metric or selected metrics may be assigned a weight during configuration indicating a 
metric priority. If multiple metrics have the same metric priority, the metrics may be 
C3 added to obtain a composite metric. For example, if two metrics have the same 

Q 10 metric value, the metric values may each be multiplied by their respective weight 
5^1 { values (if specified) and then added together to form the composite metric for the 

^0 corresponding server. According to one embodiment, the default weight values are 

M set to 1 . 

;^ The metric value for each one of the plurality of servers are then compared to 

S3 1 5 choose one or more "selected" servers (e.g., a server having a lowest or highest metric 
Q value) at step 40. In the embodiment described below, it is assumed that the server 

having the lowest metric value is selected. Moreover, a set of the plurality of servers 
having metric values within the tolerance range may be "selected". 

If multiple servers are determined to have been "selected" at step 42 (the 
20 metric values associated with the selected servers are within the tolerance range), an 
alternate metric may be used as a "tie breaker" at step 44. By way of example, the 
server closest in distance to the client may be selected. As yet another example, the 
next metric may be used to select the "best" server. 

Next, at step 46, the IP address associated with the selected server is provided 
25 to the client from which the server request was obtained. By way of example, the IP 
address may be returned to the client's local DNS. Therefore, the server request is 
distributed using the portion metric assigned to each one of the plurality of servers, 
the number of server requests distributed to each one of the plurality of servers, and 
the total number of server requests as described above. As a result, the distributed 
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director effectively functions as a DNS server. In this maimer, a server request may 
be distributed in accordance with the DNS. 

The tolerance range specified for each port ultimately determines the 
frequency that the portion metric is used to allocate server requests. By way of 
example, the first metric to be applied may be the portion metric and the second 
metric to be applied may be the distance metric. If the tolerance range for the portion 
metric is set to 20 percent, all servers having a metric value within 20 percent of the 
optimum metric value (e.g., lowest or highest metric value) will be considered to be 
equally desirable. In this case, the second distance metric is applied as a "tie 
^ 10 breaker". Accordingly, the second distance metric will be used in all instances where 
LJ two or more servers are within the tolerance range of an optimum metric value. As a 

% result, the portion metric is used only in instances where only one server falls within 

this tolerance range. In other words, the portion metric will ultimately be used only to 
C3 correct load imbalances according to the configured tolerance range, while a second 

□ 1 5 metric (e.g., distance metric) wdll be applied when there is no load imbalance, 
i y 

J3 As described above, the frequency that the portion metric is applied may 

provide an indication of load imbalance. Monitoring the application of the portion 
metric will therefore indicate whether the load apportionment among the servers is 
appropriate. Accordingly, a counter may be incremented each time the portion metric 
20 is applied. This counter value may then be used to correct any load imbalance present 
in the distributed system. 

Since the application of the portion metric (e.g., one server is selected 
according to the portion metric) will indicate the presence of a load imbalance within 
the distributed system, a report may be generated each time the portion metric is 
25 applied. The report may include information designed to assist in correcting load 

imbalance. By way of example, the report may include the number of server requests 
distributed to each server. Thus, the report will fiimish the necessary information to 
allow a system administrator to reconfigure selected portion metrics and tolerance 
ranges, as well as modify the allocation of the servers. By way of example, the 
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geographical distribution of the servers may be modified. As yet another example, 
additional servers may be added to a particular geographical location. 

Once the server request has been distributed to a server, all variables are 
updated to reflect the receipt of a new server request as well as the allocation of the 
5 new server request to the selected server. At step 48, the number of server requests 
distributed to the selected server, 

NUMBER_SERVER_REQUESTS[SELECTED_SERVER] is increased. In this 
manner, the number of server requests distributed to each one of the plurality of 
£□ servers is maintained. According to one embodiment, the number of server requests 

Si 10 distributed to the selected server is increased in response to step 46. Similarly, at step 
T,l 50 the total number of server requests processed by the plurality of servers, 

^0 TOTAL_REQUESTS, is incremented. 

5 The above steps may be modified or performed in an alternate order without 

departing from the scope of the invention. For example, according to a presently 
SH 1 5 preferred embodiment, the total number of server requests processed by the plurality 
P of servers is incremented in response to step 46. Accordingly, the total number of 

server requests is incremented by one. Alternatively, the total number of server 
requests may be computed as a sum of the number of server requests distributed to 
each one of the plurality of servers. As another example, the total number of server 
20 requests may be incremented in response to step 29. This would eliminate the 

possibility of a "divide by zero" error since the total number of server requests would 
be incremented prior to step 36. 

Referring now to FIG. 4, a flow diagram illustrating a method for calculating 
the server request percentage for each of the plurality of servers 36 shown in FIG. 2 

25 according to an embodiment of the present invention is shown. At step 52, a first one 
of the plurality of servers is selected. For example, if the server request percentage 
for each of the plurality of servers is stored in an array, each one of the plurality of 
servers may be selected according to an index, X. As shown, the first index in such 
an array may be one. At step 54, the number of server requests distributed to the 

30 selected one of the plurality of servers and the total portion metric are multiplied to 
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obtain a product. Next, at step 56, the product is divided by the total number of 
server requests received to obtain the server request percentage for the selected one of 
the plurality of servers. According to a preferred embodiment, in order to avoid 
dividing by zero, it is determined if the total number of server requests is equal to 

5 zero. If the total nimiber of server requests is equal to zero, the server request 

percentage for the one of the plvirality of servers is initialized to zero. Next, at step 
58, it is determined whether the server request percentage for each of the plurality of 
servers has been calculated. For example, if the index, X, is equal to the nimiber of 
servers N, all necessary calculations have been performed. However, if the index, X, 

1 0 is less than the nvimber of servers, the server request percentage must be calculated for 
the remaining servers. At step 58, if the server request percentage has not been 
calculated for each of the servers, the index, X, is incremented at step 60, and the 
method continues at step 54. 

Referring now to FIG. 5, a flow diagram illustrating a method for determining 
1 5 the metric value for each of the plurality of servers 3 8 shown in FIG. 2 according to 
one embodiment of the invention is illustrated. At step 62, a first one of the plurality 
of servers is selected. As described above, a server may be selected according to an 
index, X. At step 64, the portion metric associated with the selected one of the 
plurality of servers, PORTION_METRICpC], is subtracted from the server request 
20 percentage for the selected one of the plurality of servers, 

SERVER_REQUEST_PERCENTAGEPC], to obtain a metric value for the selected 
one of the plurality of servers, METRIC_VALUEpC]. Thus, METRIC_VALUE[X] = 
SERVER_REQUEST_PERCENTAGE[X] - PORTION_METRICpC]). In addition, 
as described above, a constant such as TOTAL_REQUESTS may be added to this 
25 value to ensure that the METRIC_VALUE[X] is a positive number. Next, at step 66, 
it is determined whether the metric value for each of the plurality of servers has been 
calculated. For example, if the index, X, is equal to the number of servers N, all 
necessary calculations have been performed. However, if the index, X, is less than 
the number of servers, the metric value must be calculated for the remaining servers. 
30 At step 66, if the metric value has not been calculated for each of the servers, the 
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index, X, is incremented at step 68, and the method loops at step 64 imtil the metric 
value for each server has been calculated. 

According to another embodiment of the invention, the distributed director 
operates in HTTP redirect mode in accordance with the HTTP protocol. Referring 

5 now to FIG. 6, a method for operating the distributed director in HTTP redirect mode 
84 is presented. The distributed director is configured at step 86. Configuration may 
include associating each host with a plurality of IP addresses corresponding to a 
plurality of web servers in the distributed system, assigning a portion metric to each 
server, assigning other metrics to each server, and specifying a tolerance range for 

10 application of each specified metric as described above in step 28. Next, at step 88, 
an HTTP connection is accepted from a client. Thus, the distributed director operates 
as though it were a web server. The distributed director has an IP address as well as a 
different IP address used by a web redirector. As a resuU, the client may 
communicate with the web redirector by connecting to the web redirector' s IP 

1 5 address. The web redirector may then redirect connections through the use of virtual 
web servers. By way of example, a different host (and IP address) is associated with 
each set of virtual servers. The client is then connected to one of a set of virtual web 
servers associated with the distributed director at step 89. The client's IP address is 
then obtained at step 90 since it is later used to send an HTTP redirect. A host name 

20 (e.g., www.cisco.com) associated with the IP address of the virtual web server 

connected to is then determined at step 92. This may be performed via a DNS server, 
as described above. 

Once the client is connected to the virtual web server, it is desirable to obtain a 
set of IP addresses from which to select an IP address and send an HTTP code 

25 redirect. One problem which may occur when a set of IP addresses associated with 
the host name of the virtual server is obtained is that the IP address of the virtual web 
server connected to may ultimately be selected. Since it would be extremely 
undesirable to select the virtual web server that the client is connected to, a new host 
name different from the host name of the virtual server connected to may be 

30 constructed. At step 93, a new host name associated with the host name determined 
in step 92 is constructed. By way of example, a string may be appended to the host 
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name of the virtual server that the cUent is connected to. A set of IP addresses of real 
web servers associated with the new host name are then obtained at step 94. By way 
of example, the real web servers may be associated with the new host name in a DNS 
server. In this manner, the IP address of the virtual web server connected to may be 
excluded, or filtered, fi-om selection. 

Once the set of IP addresses are obtained, an IP address may be selected. As 
shown in FIG. 6, one of the set of IP addresses may be selected according to steps 32 
through 44 described above. At that time, the client's IP address may be used in the 
selection process depending on the metric(s) applied. By way of example, the 
distance metric may use the client's IP address to ascertain the distance between a 
server and the client. The distributed director then sends to the client the HTTP status 
code "302 temporarily moved" with a URL for the selected server associated with the 
selected IP address at step 96. The client may then transparently connect to the 
selected web server specified by this URL. System variables may then be updated to 
reflect the server selection according to steps 48 through 50 described above. In this 
manner, a client may communicate with the server to which it has been redirected 
transparently through the distributed director web server. Accordingly, information 
provided to a client may be selectively varied according to the particular client 
requesting the information. 

The present invention redirects clients to an appropriate server according to 
one or more metrics. This eliminates the need for end users to choose a server from a 
list of possible sites. In addition, since the distributed director localizes traffic to 
Intemet servers, end users benefit fi-om improved access performance and reduced 
transmission costs. 

The invention can also be embodied as computer readable code on a computer 
readable medium. The computer readable medium is any data storage device that can 
store data which can thereafter be read by a computer system. Examples of the 
computer readable medium include read-only memory, random-access memory, CD- 
ROMs, magnetic tape, and optical data storage devices. The computer readable 
medium can also be distributed over a distributed system between coupled computer 
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systems so that the computer readable code is stored and executed in a distributed 
fashion. 

Although illustrative embodiments and applications of this invention are 
shown and described herein, many variations and modifications are possible which 
remain within the concept, scope, and spirit of the invention, and these variations 
would become clear to those of ordinary skill in the art after perusal of this 
application. For instance, the present invention is described as being implemented 
and capable of operating in DNS mode or HTTP redirect mode. However, it should 
be understood that the invention is not limited to such implementations, but instead 
would equally apply regardless of the context and system in which it is implemented. 
Thus, broadly speaking, the operations described above may be used to distribute 
traffic in a distributed system based on predefined server metrics. In addition, the 
above described invention may be stored on a disk drive, a hard drive, a floppy disk, a 
server computer, or a remotely networked computer. Accordingly, the present 
embodiments are to be considered as illustrative and not restrictive, and the invention 
is not to be limited to the details given herein, but may be modified within the scope 
and equivalents of the appended claims. 



CISCP048C 1 /LVP/ERH 



Page 19 



